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The study of properties of mean functionals of random probability measures is an important 
area of research in the theory of Bayesian nonparametric statistics. Many results are now known 
for random Dirichlet means, but little is known, especially in terms of posterior distributions, 
for classes of priors beyond the Dirichlet process. In this paper, we consider normalized random 
measures with independent increments (NRMI's) and mixtures of NRMI. In both cases, we are 
able to provide exact expressions for the posterior distribution of their means. These general re- 
sults are then specialized, leading to distributional results for means of two important particular 
cases of NRMI's and also of the two-parameter Poisson-Dirichlet process. 
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1. Introduction 

Bayesian nonparametrics has recently undergone major development which has led to the 
proposal of a variety of new classes of prior distributions, as well as allowing the concrete 
application of nonparametric models to problems in, for example, biology, medicine, 
economics and ecology. While there is a vast literature on computational issues related 
to Bayesian nonparametric procedures, there is a dearth of analytical results, mainly due 
to the difficulties of studying distributions on infinite-dimensional spaces. Indeed, given 
a nonparametric prior, a natural statistical object to analyze is the mean functional: 
for instance, in the context of survival analysis, the mean takes on the interpretation of 
(random) expected lifetime. However, such an analysis seemed to be prohibitive until the 
pioneering contributions of Cifarelli and Regazzini [4, 5] who set up a general theory for 
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the study of Dirichlet process means and also derived the remarkable Markov-Krein or 
Cifarelli-Regazzini identity. Since then, much attention has been focused on means of the 
Dirichlet process. Among other contributions, we mention [7, 8, 11, 13, 16, 17, 19, 31, 39, 
43]. Recently the first results concerning nonlinear functionals of the Dirichlet process, 
such as the variance functional, have appeared in the literature (see [6, 34, 39]). Another 
line of research has dealt with mean functionals for priors different from the Dirichlet 
process (see [10, 14, 22, 33, 40, 42]). The study of means also highlights the interplay 
with other areas of mathematics, such as special functions [31], excursion theory [22, 42] 
and mathematical physics [38]. 

While some results concerning the prior distribution of means for classes of priors more 
general than the Dirichlet process are known, no exact result is known for their posterior 
distribution. Indeed, in [40], normalized random measures with independent increments 
(NRMFs), whose construction is recalled in Definition 2.1, were considered: in addition to 
results for the existence and the exact prior distribution of their means, an approximation 
for their posterior density were achieved. These results were then extended in [33] to 
mixtures of NRMI, leading to an approximation of the posterior mean of mixtures of 
NRMI and to the exact expression for the special case of the mixture of Dirichlet process. 
These two papers [33, 40] represent the starting point of our work and we aim to develop 
and complete these results: indeed, we are able to provide exact expressions for the 
posterior distributions of both means of NRMFs and means of mixtures of NRMI. 

The outline of the paper is as follows. In Section 2, the basic concepts are introduced 
and preliminary results recalled. In Section 3.1, we determine exact expressions for the 
posterior distribution of means of NRMFs, whereas in Section 3.2, general formulae for 
posterior means of mixtures of NRMI are obtained. Section 4 is devoted to the study of 
means of particular NRMFs of statistical relevance, namely the extended gamma NRMI 
and the generalized gamma NRMI, and the main result of [33] is recovered as a corollary. 
Moreover, our results for the generalized gamma NRMI are exploited to derive a new 
expression for the distribution of a mean of the two-parameter Poisson-Dirichlet process. 
Proofs arc deferred to the Appendix. 

2. Preliminaries and basic definitions 

We first recall the concept of completely random measure, due to Kingman [26]. Let 
(X, ^) be a Polish space endowed with the Borel cr-ficld and (M,^(M)) be the space 
of boundedly finite measures on X, with ^(M) denoting the corresponding Borel cr- 
algcbra. Let p, he a random element defined on (ri,^,P) and with values in (M, i^(M)) 
such that for any collection of disjoint sets in Ai, A2, . . . , the random variables 
/i(Ai), /2(^2), • ■ • are mutually independent. Then, /2 is a completely random measure 
(CRM) on (X, A CRM can always be represented as a linear functional of a Poisson 
random measure. In particular, define J^^, to be the space of measurable functions : X — > 
IR+ such that /jj+^5j[l -e-"''(^)]i/(dw,dx) < 00, where v stands for the intensity of the 
Poisson random measure underlying fx, which must satisfy the integrability condition 
Jju^, min{i;, l}iy{dv,dx) < +00 for almost all a; g X. Then, p, is uniquely characterized by 
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its Laplace functional which, for any h in Jif^,, is given by 




(1) 



Throughout the paper, we define, for any real- or complex-valued function g defined 
on X, the functional ip{g) := /jj+xx[l - e~'"^^''^iy{dv,dx). Moreover, let 1:X^R be the 
function identically equal to 1, namely l(a;) = 1 for any x in X. See [28] for an exhaustive 
account on CRM's. The representation in (1) establishes that p, is characterized by the 
corresponding intensity v. Letting a be a cr-finite measure on X, we can always write 



where p is a measurable kernel such that p{-\x) is a cr-finite measure on ^(M+) for any 
X in X. Such a disintegration is ensured by [25], Theorem 15.3.3. If (2) simplifies to 



then we say that fi is a homogeneous CRM, whereas if this is not the case, then /2 will 
be termed a non-homogeneous CRM. In the following, we will assume a is non-atomic. 

Since the aim is to define random probability measures by means of normalization of 
completely random measures, the total mass /u(X) needs to be finite and positive, almost 
surely. As shown in [40], this happens if i^(R+ x X) = +oo and the Laplace exponent 
■0(A) < -f oo for any A > 0, respectively. 

Definition 2.1. Let fl be a CRM on (X, ,2') such that i'(R+ x X) = +oo and ip{X) < +oo 
for any A > 0. The random probability measure 



is then termed an NRML 

Strictly speaking, the random probability measure in (4) is a normalized CRM and 
reduces to an NRMI when X = K. Nonetheless, we prefer to keep the acronym NRMI 
introduced in [40]. According to the decomposition of the intensity v described in (2) 
and (3), we will distinguish between non- homogeneous and homogeneous NRMFs. Several 
priors used in Bayesian nonparametric inference can be defined as in (4). For instance, 
as already noted by [12], the Dirichlet process can be recovered as an NRMI based on 
the gamma CRM for which i^{dv,dx) = v~^e~'" dv a{dx). Other examples include the 
normalized stable process [27] and the normalized inverse Gaussian process [29]. 

Nowadays, the most common use of Bayesian nonparametric procedures is within hier- 
archical mixtures: letting Y be a Polish space equipped with the Borel cr-algebra one 
defines a random density (absolutely continuous with respect to some a-finite measure 
A on Y) driven by a random discrete distribution, that is. 



i^(dw,dx) = p{dv\x)a{dx) 



(2) 



v{dv, dx) = p{dv)a{dx), 



(3) 




(5) 
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where fc is a density function on Y indexed by some parameter with values in X. A 
typical choice for k is represented by the density function of the normal distribution: 
in such a case. P controls the means (and possibly also the variances) of the random 
mixture density. This approach is due to Lo [32], who defined a random density as in (5) 
with P being the Dirichlet process; this model is now commonly referred to as mixture 
of Dirichlet process (MDP). Recently, various contributions have focused on replacing 
the Dirichlet process in (5) with alternative random probability measures, which yield 
interesting behaviors, especially in terms of the induced clustering mechanism; sec, for 
example, [18, 20, 29]. 

The present paper is focused on linear functionals of NRMI's, namely on random 
quantities of the type P{g) /^j g{x)P{dx) , where g : X — > R is any measurable function 
such that 

ipit\g\)^ [ (l-e-*^lf(^)l)p(dw[x)a(d.T) <+oo > 0. (6) 

By [40], Proposition 1, condition (6) is necessary and sufficient for P([(?[) to be a.s. finite. 
In the sequel, we always assume that (6) holds true. An exact analytic expression for F, 
the distribution function of P{g), is given in [40], Proposition 2. We will also examine 
means of a mixture of NRMI, that is 

Qig) / giy)mX{dy) = / h{x)Pidx) = P{h), (7) 

where Q stands for the random probability measure associated with /, defined as in 
(5), and h(x) = g{y)k{y,x)X{dy). Hence, as shown in [33], the necessary and sufficient 
condition for Q{g) being a.s. finite becomes 

ipith*):^ [ (l-e^*'''''(^))p(di;[a;)a(da;) <+oo Vi > 0, (8) 

with h*{x) = Jy \g{y)\k{y,x)\{dy). The evaluation of the prior distribution of the mean 
then follows in a straightforward way [33], Proposition 2. In the following, when we 
consider means of mixtures of NRMI, as in (7), we will tacitly suppose that g verifies 
condition (8). 

3. Posterior distribution of means 
3.1. Means of NRMI's 

We first focus attention on posterior distributions of means of NRMI's. Let {Xn)n>i be 
a sequence of exchangeable observations, defined on (r2,^,P) and with values in X, such 
that, given an NRMI P, the XiS are i.i.d. with distribution P, that is, for any Bi G ^ , 
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1 , . . . , n and n>l, 



'[Xi e Bi, . . . , x„ e B„|F] = J] P{Bi). (9) 



Moreover, let X = {Xi , Xn). It is clear that one can always represent X as (X* , tt) , 

where X* = (X^, . . . , X*^^^^^) denotes the distinct observations within the sample and tt = 

{Ci, . . . , C„(^)} stands for the corresponding partition of the integers {1, . . . , n} recording 

which observations within the sample are equal, that is, Cj = \i: Xi — X*}. The number 
of elements in the jth set of the partition is indicated by uj, for j — 1, . . . ,n{Tv), so that 

En(7r) 
jU rij = n. 

At this point, it is useful to recall the posterior characterization of NRMFs given in [23] . 
For any pair of random elements Z and W defined on {il, J^, P), we use the symbol Z^^) 
to denote a random element on (fi, P) whose distribution coincides with a regular 
conditional distribution of Z, given W. Now, introduce a latent variable, denoted by J7„, 
whose conditional distribution, given X, admits a density function (with respect to the 
Lebesgue measure on R) coinciding with 

n(7r) 

/^Ju) cxu"-i n T„,(w|X;)e-'^("), (10) 

i=l 

where 

tMX*)^ I s"-e— p(ds|X*) (11) 



for i = 1, . . . , nijc). Indeed, the posterior distribution, given X, of the CRM ji defining 
an NRMI (4) is a mixture with respect to the distribution of the latent variable J7„. 

Specifically, /iC^-^) = fi^^-^^ +Y,l=i jf^'^^^x^, where: /i^^") is a CRM with intensity 

v^^-\ds,dx)=e-^-'p{ds\x)a{dx)] (12) 

the X*'s are the fixed points of discontinuity; the are the corresponding jumps, 

which are mutually independent and independent from fl^'^"\ and whose density is given 

by 

/if"'^)(s)(xs-e-^"V(ds|x;). (13) 

See [23], Theorem 1, for details. 

We are now in a position to provide the exact posterior distribution of means of an 
NRMI. Note that the results hold for any function g and any NRMI (identified by means 
of its Poisson intensity (2)) which lead to an a.s. finite mean (6). In what follows, we 
agree to denote by Im(z) and Rc(z) the imaginary and real parts, respectively, of the 
complex number z. Moreover, and J^"'"^'' are the Laplace exponent of the CRM 
defined by (12) and the jumps whose density is given by (13), respectively, with Un = u. 
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Theorem 3.1. Let P be an NRMI. The posterior distribution of P{g), given X, is then 
absolutely continuous with respect to the Lebesgue measure on ffi and a posterior density 
function is given by 



(14) 



R-e{xg(t,CT)}di, ifn = \, 

/a poo 
/ [{<J-z)tr-'lm{xg{t,z)}dtdz, ifn^2p, 
-oo Jo 

/ [{a~z)tr'^Rc{xg{t,z)}dtdz, ifn^2p+l, 

oo Jo 



where p>l, 



(15) 



lj = L ■ "J \ I J, 

K„^.(ii[g(X;) -z]|X;) = /+°°t-"^e"''(5(^.')-^V(dw|X;) and T„^iu\X*) is as m (11), for 
j = 1, . . . , n{-K). Moreover, the posterior cumulative distribution function of P{g), given 
X, admits the representation 



F^(cr:o) = --- lim 



C,g{(T;u,t) f^^{u) dudt, 



whe 



(g{a;u,t) ~Ini{c" 



V.("'(-it(g-al))gj^itE"il'(9(x;)-<^)4"'''' 



]} 



(16) 



(17) 



and fif'^ is the density of the latent variable [/„ given in (10). 



3.2. Means of mixtures of NRMI 



Before dealing with the posterior distribution of means of mixtures of NRMI, it is worth 
recahing that a popular way of representing (5) is as a hierarchical mixture: 

Yi\Xi^''^'^' k{-,Xi), i^l,...,n, 

X,\P P, (18) 

where {Xi)i>i is a sequence of latent variables with values in X and ^ is the law of the 
NRMI P. The notation adopted in the description of model (18) is standard in Bayesian 
statistics and means that, given P, the observations Yi are independent and identically 
distributed with random density /(y) = fc(y, x)P{dx). Now, since the background driv- 
ing NRMI is a.s. discrete, it will induce ties among the latent variables, which are denoted, 
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as previously, by X* , . . . , X*^^^ and tt = {Ci , . . . , C„(^) } indicates the corresponding par- 
tition of the integers {1, . . . ,n} with Cj = {i : Xi = X*}. This notation ahows us, given 
a partition, to keep a record of which latent variable each of the observations Y^, for 
i = 1, . . . , n, is assigned to. 

Here, we aim to derive the posterior distribution of Q{g), given Y. By virtue of (7), 
this can also be seen as the distribution of J^h{x)P{dx), given Y. The next theorem 
yields an exact expression for a density of such a posterior distribution. It is worth noting 
that the results hold for any function g, k and NRMI which load to a finite mean (8): 
the expressions are given in terms of h (which is in turn, defined as in (7)), k and the 
Poisson intensity (2) corresponding to any NRMI. 

Theorem 3.2. Let f be a mixture of NRMI, as in (5). The posterior distribution of 
Q{g), given Y, is then absolutely continuous with respect to the Lebesgue measure on M 
and the corresponding posterior density function is given by 



(P'icT-^g) 
where 



Rc{a(t,fT)}dt, 

/ [(<7-z)t]"-ilm{a(i,z)}didz, 

-oo Jo 

p<7 p<7 poo 

(-l)M / / [{'J-zWRe{Ut,z)}dtdz, 



— OO J —oo Jo 



ifn^l, 
ifn^ 2p, 
ifn^2p+l, 



-) Il^ec, k{Y^,x)a{dx)] 



e-^(ni) [^^ Y\;L1' /x n.ec, ^)t„, iu\x)aidx)] du 



(19) 



(20) 



having set h{x) = g{y)k{y,x)X{dy) and Kn^ {it[h{x) ~ z]\x) ~ J^"^"" ti"jc'*'"(''('^^ p{dv\x) 
for j = 1, . . . ,ri(7r). Moreover, the posterior cumulative distribution function of Q{g), 
given Y, can be represented as 



11 1 
'(c^;.9) = 7^ - -^lim / -lui{Chi<y\t)}dt, 

Z 7T T-i+oo Jq t 



(21) 



where p > 1 , 



^„_l^_^(_it(g_Crl)+ul) 



5] n / E(e"(^(^)-^i)-^i"'"') n k{Y,,x)T^^{u\x)a{d2 
(j j."-ic-'^("^) Eli / W k{Y,,x)TnM^)a{dx) dw) 



du (22) 
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where the jumps Jj' 's have density given by (13). 

4. Applications 

Let us now consider, in detail, two specific cases of statistical relevance, involving two 
important CRM's, namely, generalized gamma CRM [2] and extended gamma CRM [9]. 
Both have been found to have many applications in survival analysis, spatial statistics, 
mixture models and spatio-temporal models. 



4.1. Extended gamma NRMI 

Here, we consider NRMI's derived from extended CRM's, which are characterized by the 
intensity 

g-/3(£c)« 

j/(d?j, dx) = dv Q;(dx), 

V 

where /3 is a positive real- valued function. The corresponding NRMI in (4), to be termed 
an extended gamma NRMI with parameters («(•),/?(•)), is well defined if a and /? are 
such that /jjlog(l + t[j3[x)]~^)a{dx) < +oo for every t>0. 

As for distributional properties of means of extended gamma NRMI's, from (6), it 
follows that P{g) is finite if and only if J^log(l + tg{x)[l3{x)]~'-^)a{dx) < +oo for every 
t>0, which coincides, except for the factor of non-homogeneity /3, with the well-known 
condition given in [11] for the Dirichlet process. Moreover, the prior distribution of a 
mean is given by 

W{a;g) = --- / i lin(e- /x i°s(i-i*[/'(-)l-'(9(-)-)Md-)) dt, 

2 TtJo t 

having applied Proposition 2 in [40] and shown that F is absolutely continuous with 
respect to the Lebesgue measure on R. We now provide expressions for the posterior 
density function and posterior distribution function of a mean of an extended gamma 
NRMI. 

Proposition 4.1. Let P be an extended gamma NRMI. The posterior density function 
of P(g), given X, is then of the form (14) with 

g- log{f!{x)-it{g{x)-z))a^{dx) 

where a^{-) '■= cti') + '^^IX^ i^i^X'i')- Moreover, the posterior cumulative distribution 
function is given by 

X 1 /„°°/o°°t-iw"-ilm{c-i;.i°g('3(-)+"-i*(f(-)--))"^(d-)}d?/dt 
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It is worth noting that the expressions in Proposition 4.1 are surprisingly simple, 
given the fact that they are exact distributions of functionals of a non-conjugate prior. 
Indeed, in terms of complexity, they are no more involved than the known expressions 
for Dirichlet means [4, 5, 39]. For illustrative purposes, suppose n = 1 and let [3{x) = 
(3i1a{x) + P2^A''{x), where /3i > /32 > and In this case, it can be easily seen 

that the normalizing constant in coincides with 

f+co -1 a-ai{A} ol-aiiA") / o \ 

where, for simplicity, we have set ai := af- and 2F1 is the Gauss hypergeometric function. 
Hence, by resorting to (14), one has that a posterior density of P{g), given Xi, evaluated 
at z G [0, 1] coincides with 

a(X) /Jj'^^^^/Sf 



7T 2i^i(ai(A),l;a(X) + l;l-/32//3i) 



-f 00 



cxp^ -i / log[l3^{x)+t^{g{x)-zy]ai{dx) } (23) 



/ f t(g(x)-z) , , A , 
X cos / arctan --— aiidx) at. 



It is worth emphasizing that, since this is a density function, one obtains an integral 
representation for the hypergeometric function 2-Fi(ai(^), 1; a(X)-|- 1; 1—^2^1^) in terms 
of g, that is, 

2i^i(^«i(A),l;a(X) + l;l-^ 



^ a(X)^„,(A)^ai(A-)-l f f °° ^-l/2f^ los[P^{x)+t^{g{x)-zf]a,{dx) 

^ ^ Jo Jo 

{ f t(g(x)-z) A , , 

X cos / arctan — ai[dx)]ataz. 

\Jx P(x) J 

If we further suppose that g = 1a, then (23) provides a posterior density for P{A), 
given Xi. In order to obtain a simplified expression, suppose that A and a are such 
that a{A) = aiA") = 1 and Xi G A so that ai(A) = 2 = 1 + aiiA"). The corresponding 
posterior density for P{A) is then of the form 



2 



2l3f 

2Fi(2, 1; 3; 1 - /32//3i) [Piz + /32(1 - z)]^ ^l^'^^^'^' ^^4) 

Details of the determination of (24) are given in the Appendix. Also, note that 
2Fi(2,l;3;l - P2/M = 2/3? /^^[/^iz + (32{1 - z)]-^ dz = 2pl\og{P,/ ^Pi - ^2)-^ - 
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2[3i{pi — /?2) ^ ■ Since the (prior) probability distribution function of P{A) is 
F(a; 1^) = (-) + 1(1,^) (-) 

and the corresponding density is /3i/32[/3iO' + fh{'^ — o')]^^l[o.i](o')i one finds out that 
having observed Xi G A, the probabiUty mass of P{A) in a neighborhood of decreases, 
while it increases in a neighborhood of 1. The opposite phenomenon is observed when 
Xi is not in A. 

We now move on to considering posterior distributions of means of mixtures of the 
extended Gamma NRMI. Recall that, in such a case, the data Y are i.i.d., given Q, 
where Q is the random probability measure corresponding to the mixture density (5) 
driven by an extended gamma NRMI. As shown in [33], a mean functional Q{g) is finite 
if and only if for every t > 0, one has /jjlog(l + th* {x)[l3{x)]~^)a{dx) < +oo, where 
h*{x) — ^^\g{y)\k{y^x)\{dy). The prior distribution function is then of the form 

F(a;5) = i- - / ilm(e-/xi°g(i-i*(M-)-)[/5(-)l-'Md-))dt, 

with h{x) = Jy \g(jj)\k{y,x)X{dy). The next proposition provides the corresponding pos- 
terior density and cumulative distribution function. For both, we provide two expressions: 
the first is in line with Theorem 3.1, whereas the second exploits a sort of quasi-conjugacy 
peculiar to gamma-like models (see also Remark 4.1 in Section 4.2). 

Proposition 4.2. Let Q be the random probability measure associated with the random 
density (5) driven by an extended gamma NRMI. The posterior density function of a 
mean Q{g), given Y, is then of the form (19), with 

^h{t,z) 



("j - / e-'°s('5(")-'*(''(")-"»"^- Yl kiY„x)a{dx) (25) 

TT j=l •'^ zGCj 




X 7T / u e 



n-1 - f^log{f!{s)+u)aids) 



n(7r) ^ 1 

^ \ " 

TT j=l ''■'^ ieCj 



flK -l)' / e-'°s('^(=^)+")"' Yi k{Y„x)a(dx)du 

■K j = l 

or, alternatively, as (19), with 



(26) 
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w/iere a^(-) = a(-) + ^"^^^ rijJa;^ anrf, as previously, h{x) = J^g{y)k{y,x)X{dy). More- 
over, the posterior distribution function can be represented as (21), with 

^n-l^- J^\og(u+P{s)-it{h{s)-a))a{ds) ^27) 
n(7r) 



U 

+ 



J2 IlK-l)' / e-'°s("+'^(^)-'*(''(^)-'^»"^- Y[ k(Y„x)a{dx)du 



X 



X I 7T / W C 



n(7r) 



X 

7r i=l ""^ ^eCj 



or, alternatively, as (21) with 



"-ie-ii°g("+'3('^)-i*(''(-'s)-'^))"? dw 
Y[k{Y,,Xj)aidx,) (28) 



li e 



X 



X Yl k(Y,,Xj)a{dxj) 

As a corollary of Proposition 4.2, we obtain the main result of ([33], Theorem 1), namely 
the posterior density of means of the mixture of Dirichlet process model introduced in [32] . 
Note that there is a slight inaccuracy in the expression for this density in Corollary 1 of 
[33], which should coincide with the one below. Moreover, we also obtain a representation 
for its posterior cumulative distribution function, which is new. 
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Corollary 4.1. Let Q be the random probability measure associated with the mixture of 
Dirichlet process. The posterior density function of a mean Q{g), given Y, is then given 
by 

= i—x 1 (29) 

where 

p"(a;fe)= ° + "~^ / Re{e-Aiog(i-iW^)-))"?(d«)}di (30) 

is the posterior density of P{h), with P being the Dirichlet process and, as before, a^{-) = 
a(-) + '^j'^^j ■ Moreover, the posterior cumulative distribution function of Q{g), 

given Y, can be represented as 

C (cr;o =- 7—. , (31) 

2 ^E.n;iT^K-i)!/xn.ec,M>^.,^)«(dx) 

with C{a;h) = i Im{c- /xi°g(i-i*(''('^)-'^))"? (d's)} dt being the posterior cumulative 
distribution function of a Dirichlet mean P{h). 



4.2. Generalized gamma NRMI's 

Let us now consider the generalized gamma NRMI, which is based on the CRM with 
intensity 

^(d«'d2^) = f(Y^^di;a(da;), (32) 

where 7 € (0, 1) and r > 0. This class can be characterized as the exponential family 
generated by the positive stable laws (see [2. 36]). It includes the stable CRM for t = 0, 
the inverse Gaussian CRM for 7 = 1/2 and the gamma CRM as 7 — !> 0. Note that the 
resulting NRMI, termed a generalized gamma NRMI, is well defined if and only if a is 
a finite measure. Before proceeding, recall that E[P(-)] = ~ ^o(')j where a := a(X), 
and Pq is usually referred to as the prior guess at the shape of P. 

Now, from (6), and by noting that ipig) — Jxi''' ~^ di^))"' '^i'^^) ~ t'^o,, it follows imme- 
diately that P{g) is finite if and only if 

{t + t\g{x)\ya{dx) <oo for any <> 0. (33) 

Henceforth, we consider functions g such that (33) holds true. In the following proposi- 
tion, we provide expressions for the prior distribution of P{g). 
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Proposition 4.3. Let P be a generalized gamma NRMI. The cumulative distribution 
function can then be expressed as 

2 n Jq t 

where /? = ar'^ > 0, or, also, as 

1 „0 roo 1 

n<^-,9) = l^-- -c-^^^^'hm{l3By{t))dt, (35) 

2 n Jo t 

where 

Aait) = y [1 +t^(.g(a;) - cr)^]^/^ cos{7arctaii[t(,9(.T) - a)]}Po{dx), (36) 
B^(t)= / [l+<^(.g(2;) -cr)V^^sin{7arctan[i(5(a;) -cr)]}Fo(dx). (37) 



Let us now turn our attention to posterior quantities. The next result provides both 
the posterior density and the posterior cumulative distribution function of the mean of 
a generalized gamma NRMI. 

Proposition 4.4. Let P be a generalized gamma NRML The posterior density function 
of P{g), given X, is then of the form (14), with 

""'^'^"^^ -E;roM"7^)(-i)^-/?^/-rK..)-,77i) ' ^''^ 

where r(-; •) stands for the incomplete gamma function. Moreover, the posterior cumula- 
tive distribution function can be written as 

F^(a;.g) 



00 poo 

t-'u^'-'lmi e- 



"'0 



n(7r) 1 

xY[[l + u- itig{X*) - a)]^-"^ \ dudt 



n-l 



X |^7r(7/3"(-))-^^^ (^"^. ^^(-l)^/?^"/'^r(n(7r)-j77,/3)j . 

Remark 4-1- At this point, it is useful to compare the expressions arising for extended 
gamma NRMFs and for generalized gamma NRMI's. As for the latter, by looking at (38) 
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and at the imaginary part of the distribution function of the posterior mean in Propo- 
sition 4.4, one can easily identify the characteristic functional of the generalized gamma 
CRM times a product of n(7r) terms, each of these being the characteristic function of a 
gamma random variable. Now, each term is clearly associated with a distinct observation 
X* and the number of times rij it has been recorded. Moreover, the precise expression 
arises by taking the derivative of the characteristic functional (which corresponds to ob- 
serving a certain value for the first time) and then by taking the (rij — l)th derivative of 
the resulting expression (which corresponds to observing repetitions of this observation) . 
This structure is apparent in the expressions in Proposition 4.4 and, indeed, it is the idea 
of seeing the observations as derivatives which inspires the proof of Theorem 3.1. Turn- 
ing our attention to the extended gamma NRMI which is an extension to the Dirichlet 
process, one still derives the posterior as derivatives, but, then, due to the important 
relation 



one can adjoin the derivatives (observations) to the characteristic functional, getting a 
sort of quasi-conjugacy, which is a proper conjugacy property if and only if the NRMI 
is the Dirichlet process. By conjugacy, one usually refers to the fact that the posterior 
distribution is of the same form as the prior with updated parameters. Here, by quasi- 
conjugacy, we refer to the fact that, although conjugacy itself does not hold, it does hold 
conditionally on some latent variable. 

We are now in a position to provide the posterior distribution of means of a random 
mixture density (5) driven by a generalized gamma NRMI. Before stating the result, we 
introduce the notation {a)b = r(a + b)/T{a) for the Pochhammer symbol. 

Proposition 4.5. Let Q be the random probability measure associated with the random 
density (5) driven by a generalized gamma NRMI. The posterior density function of a 
mean Q{g), given Y, is then of the form (19), with 



r(ni)e"-ti°s(i+"(/'(^))^')"('i^) 



(u + /3(2/,))-"' 



(39) 



ih{t,z) 



1^-0 S^(l-it(h(x)-z))-< Po(dx) 



n 



X 




TT j = l 




(40) 



X 
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where h{x) = ^^\g{y)\k[y,x)\{(ly). Moreover, the posterior distribution function can be 
represented as ( 21 ), with 

n;'i"^ (1 - l)n,-i /x[l - ~^)+ ^l^-"-- U^ec, HY^, x)aidx) du 

4-2.1. Connections with the two-parameter Poisson-Dirichlet process 

In this section, we point out a connection between a generalized gamma NRMI and 
the celebrated two-parameter Poisson-Dirichlet process due to Pitman [35]. The latter 
random probability measure has found interesting applications in a variety of fields such 
as population genetics, statistical physics, excursion theory and combinatorics; for details 
and references, see [37]. A recent systematic study of functionals of the two-parameter 
Poisson-Dirichlet process is provided in [22]. However, some interesting new results follow 
from the treatment of the generalized gamma NRMI given above. 

Let us first recall the definition of a two-parameter Poisson-Dirichlet random prob- 
ability measure. Suppose that pL^ is the 7-stable CRM arising from (32) when t = 0. 
Let £2^ denote its probability distribution on (M, ,^(M)) and ^.y,g be another probabil- 
ity distribution on (M,^(M)) such that ^-^^g < and {d^.ffi/d^^){^i) = (//(X))"®, 
where 9 > —7. If jl~f^g is the random measure whose distribution coincides with ^^^g, one 
then defines the two-parameter Poisson-Dirichlet process as P^^g = fi^ £ / fi^ . Wc will 
now show how to exploit the fact that P^£{g) can be obtained as a suitable mixture of 
means of generalized gamma NRMI's in order to obtain a new representation for the 
distribution function of P^^g{g). 

Denote by Z a gamma random variable with shape parameter 0/7 > and scale pa- 
rameter /3 > 0, that is, 

/.(.) = (42) 

Let jlz be a CRM, independent of Z, with intensity measure obtained by multiplying 
the Levy measure corresponding to the generalized gamma CRM in (32) by Z , that is, 

Z7 e""^^ 
vz (dz;, dx) = ^ dz; a{dx) , 

and define Pz = flz / ftz{^)- By (34), we can set t = 1 without loss of generality. See 
also [36], Section 4. It can then be shown that for any function g:X^ R+ satisfying the 
intcgrability condition (33), one has 

+00 

nPM<x]fz{z)dz^P[P^,g{g)<x]. (43) 
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The above equality in distribution follows by noting that 

for any lo £ C such that | arg(a;)| < 7t, where the first expected value is computed with 
respect to the product measure of the vector {Pz,Z). Using this connection, one can 
exploit the representation for the distribution of Pz{g) given in Proposition 4.3 in order 
to deduce a new. surprisingly simple, expression for the probability distribution of P^,g(g), 
which can be compared with alternative representations given in [22]. 

Proposition 4.6. Let g be a function for which (33) holds true. The cumulative dis- 
tribution function of the mean of a two-parameter Poisson-Dirichlet process P-f.e{g) can 
then be represented as 

11 r sm{Oh^^ct^^BAt)/A,{t)) 

where and are defined in (36) and (37), respectively. 

As an interesting consequence of the representation in (43), we obtain the finite- 
dimensional distributions of the two-parameter Poisson-Dirichlet process with 7 = 1/2 
and > 0, which were first obtained in [3], Theorem 3.1. Before stating the result, it is 
worth noting that if 7 = 1/2, the generalized gamma CRM reduces to an inverse Gaussian 
CRM and that, consequently, the finite-dimensional distributions of the two-parameter 
Poisson-Dirichlet process are obtained as mixtures with respect to those of the inverse 
Gaussian NRMI. 

Proposition 4.7. For any partition of X into sets ^1, . . . , An £ ^ such that Po(^i) = 
Pi > for any i, a density function of the random vector (Pi/2^g{Ai), . . . , P]^/2,e(^n-i)) 
on the simplex A„_i — {{wi, . . . , Wn-i) G [0, 1]"^"'^ : Wi <1} is given by 

, (n:ii ne + n/2) w-^/^ ■ ■ ■ w-%\i - tz' mr^/' 

J[Wi,. . .,Wn-l) — 



n{n-i)/2 T{d + l/2) [An{wu...,Wn-l)]''+"/^ 

where An{wi, . . . ,w„_i) = J27=i Pf'^i^ +pI{^ ^ J27=i 

Remark 4-2. Two interesting distributional properties of Dirichlet means can be read- 
ily extended to means of species sampling models, which include homogeneous NRMFs 
and the two-parameter Poisson-Dirichlet process as special cases [36]. Recall that 
a species sampling model is defined as an a.s. discrete random probability measure 
P(-) = J2i>iPi^Xi{-) such that the pi's (weights) are independent from the X^'s (lo- 
cations), which are i.i.d. from some non-atomic distribution Pq. The first property we 
consider is related to the symmetry of the distribution of P{g). If P is a Dirichlet pro- 
cess, conditions for symmetry have been investigated, for example, in [13, 39]. If P is a 
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species sampling model, it can be shown that, analogously to the Dirichlet case, P{g) is 
symmetric if the distribution of g{X), where X is a random variable with distribution 
Pq, is symmetric. Another distributional property of a mean of a Dirichlet process was 
considered in [43], where the author proves that Pq o has the same distribution as 
P{g) if Po o g~^ is Cauchy distributed. By mimicking the proof of [43], one can easily 
show that such a property holds true when P is any species sampling model. 

5. Concluding remarks 

In this final section, we briefly discuss two further issues, namely the concrete imple- 
mentation of the results provided in the paper (also in relation to simulation schemes) 
and frcqucntist asymptotics. As for the former, wc note that in the numerical compu- 
tation of the integrals involved in the distributional formulae for means of NRMI's, the 
integrands arc typically well behaved, which, in some sense, is quite natural, given the 
"mean-operation" has a smoothing effect: hence, for specific cases, exact expressions are 
computable with numerical double-integration packages; see, for example, [39, 40]. For 
mixtures of NRMI, numerical computation becomes problematic since sums over par- 
titions are involved, which increase the computing time at an exponential rate. As an 
alternative, one could also resort to simulation schemes to obtain an approximation of the 
distribution of interest. However, the simulation of realizations of an NRMI is a delicate 
task, since it is based on CRM's which jump infinitely often on any bounded interval. 
Consequently, a simulation algorithm is necessarily based on some truncation, which, 
in some cases, may compromise posterior inferences; see, for example, [10]. This implies 
that the availability of exactly computable expressions for distributions of means is also 
useful for practical purposes as a benchmark for testing the accuracy of a simulation 
algorithm: one simulates trajectories of the process (by tuning the number of jumps and 
the number of trajectories) until a suitable distance between the exact and the simulated 
distribution of the mean is less than a prescribed error. If this is the case, the numerical 
output can then be exploited to compute any quantity of interest (not just means). 

In order to evaluate frequentist asymptotic properties of Bayesian procedures, the 
data are assumed to be independently generated by a "true" distribution Ptr- As far as 
consistency is concerned, one essentially has that NRMI's are consistent if Ptr is discrete 
and inconsistent if Ptr is non-atomic (with the exception of the Dirichlet process). This 
can be informally illustrated by looking at, for instance, the 7-stable NRMI: the predictive 
distributions associated with such an NRMI are of the form 

P(X„+i G -IX*,. . . ,X,:(^)) = ^Po(-) + - E ("^ - ^)^A7(-)- (45) 

i=l 

If the data are generated independently from a non- atomic Ptr, sdl observations are dis- 
tinct, which implies that n{7v) ~ n almost surely. Hence, as the sample size diverges, the 
predictive distribution converges weakly, almost surely, to 7P0 + (1 — 7)Ptr and E[P((7)|X] 
converges to ')Po{g) + (1 — l)Ptr{g)- Consequently, the posterior distribution of P{g) is 
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inconsistent unless Pq = Ptr- On the other hand, if Ptr is discrete, then hnin(7r)/n = 
ahiiost surely and E[P(gr)|X] converges to the "true" value Ptrig)] see [21, 24] for general 
results in this direction. Inconsistency with respect to a continuous Ptr is not really prob- 
lematic since, for modelling continuous data, one would naturally use an NRMI mixture 
rather than an NRMI itself and, in such a case, we have consistency under mild assump- 
tions [30]. A related issue of interest is the validity of Bernstein-von Mises theorems, 
which provide information about asymptotic normality of the posterior distribution and 
asymptotic equivalence with respect to MLE estimation. For the nonparamctric case, 
only a few results, both positive and negative, are currently known; see [16] for a stimu- 
lating account of the topic. The derivation of such results can be a formidable task since 
the posterior representation of the random probability measure is explicitly involved. 
This implies that no result is known, even for the mixture of Dirichlet process. Recently, 
in [21], a Bernstein-von Mises- type theorem has been obtained for the two-parameter 
Poisson-Dirichlet process in the case of continuous Ptr. Under the same assumptions as 
in Theorem 3.1 of [21], we can then deduce the asymptotic behavior for the posterior 
distribution of a mean functional of the two-parameter Poisson-Dirichlet process, de- 
noted by P^g{g)- Indeed, we obtain that as the number of observations generated by an 
absolutely continuous Ptr diverges, 

n'/HP^e{9)-nP^e{9)]} 

An(0, (1 -7)[^^tr(5') - [Ptr(5)]'] +7(1 -7)[Po(.9') - [Poig)?] 
+ 7(Ptr(g)-Po(.g))') a.s., 

where N(/Lt, a^) stands for a Gaussian random variable with mean yu and variance cr^. In 
particular, if g = 1^, then the asymptotic behavior of P^g{A) is obtained, that is, the 
centered and rescaled distribution of P^g (A) converges to a mean zero Gaussian random 
variable with variance (1 — 7)[p(l — p)] + 7(1 — 7) [9(1 — q)] + l{p — qf , having set p := 
Ptr (A) and q := Pq{A). In order to derive such results for general NRMFs, an approach 
similar to [21] seems advisable, although the technicalities may become overwhelming. 
This may well represent a topic for future research. 

Appendix 

Proof of Theorem 3.1. In order to derive a representation of the posterior density, 
we start by discrctizing P according to the procedure of Regazzini and Sazonov [41]. It 
essentially consists of discretizing the random probability measure and the sample along 
a tree of nested partitions of X which, at level m, is made up of sets Bm.i, ■ • ■ , Pm,fc„+i 
with Bm,k^+i = X \ (|J^=i -Bm.i) such that Brn,k^+i i and maxi<i<fc^ dimn{Bm,i) 
as m tends +00, where diam(i?) is the diameter of B. The discretized random mean, at 

level m of the tree, will be of the form Pm{g) = '^'j^i'^ g{bm.j)P{Bm.j), where b^j is any 
point in Bm,j for j = 1, . . . , km + 1- Moreover, denote by the marginal distribution 
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of X with respect to P,„. Whenever the jth distinct element, x*, lies in Bm.i, it is as 
if we had observed bm,i', see [41] for details. Note that whatever tree of partitions has 
been chosen, there always exists an m* such that for every m > m* , the n(7r) distinct 
observations within the sample fall in n(7r) distinct sets of the partition. Given such 
a discretization. Propositions 3 and 4 in [40] easily extend to the case of NRMI's over 
Polish spaces. Thus, the discretized posterior mean converges, in the sense of almost sure 
weak convergence, to the actual posterior mean. If the observations X are such that n^^. 
of them lie in Bm,sj , for j = 1, . . . , n(7r), an expression of the posterior density function 
of the discretized mean Pm{g) is given by 



(-1)"+! 9" 

~s ', ^ 

(46) 



^m(X) . . . 



X /" ¥{a;rmfi,---,rrn,k,„+i] 



(r,„,i,...,r,„,fc„ + i) = (/(6,„,i),...J(&„,fc„ 



where F stands for the prior distribution of the discretized mean and 1^+ h{a) = 

la ^"(n-\)! is the Liouville-Weyl fractional integral^ for n>l, whereas repre- 

sents the identity operator. We now move on to the identification of the limiting density. 
To this end, set oimj = a{Bm.j), for j = 1, . . . , km+i, and rewrite (46) as 

(47) 

where A is defined as 

By virtue of the diffuseness of a, one has 

p + co 

Cl^M) = (it)"-''a™.., / z;"='e'*''('^'"-<-"V(d«|6™..,) +o(a„,,,J 
Jo 

as m ^ oo. In order to complete the proof, one needs to resort to the expression of the 
marginal distribution of the observations provided in Proposition 4 of [23] . Hence, if we 
let m tend to +oo and apply Theorem 35.7 in Billingsley [1] and dominated convergence, 
the desired result follows. Note that, as a by-product, we have also proven that the 
posterior distribution of the means is absolutely continuous with respect to the Lebesgue 
measure on R. 

The proof of the representation of the posterior cumulative distribution function in 
(16) consists of the following steps. First, we use the idea suggested by [13], that is, 

F^(a;g)=P{/i(5-al)<0|X}. (48) 
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As in [40], we now resort to Gurland's inversion formula, which, combined with Propo- 
sition 2 in [40], yields 

F^(a;.g) = i - 1 lim T ^ Im{E[c'*«^--i) JX]} dt. 

2 7T Tf+cx) Jq t 

The conclusion can now be easily deduced from [23], Theorem 1, according to which 

"'0 



□ 



Proof of Theorem 3.2. The first thing to note is that by (7), determining the distribu- 
tion of ((5(g) I Y) is equivalent to determining the distribution of {P{h)\Y) with h{x) = 
Jy g{y)k{y,x)X{dy). Moreover, by linearity of the mean, one has {P{h)\Y) = J^h{x) x 
{P{dx)\Y). Thus, we need a posterior representation of the NRMI, given the data Y 
which come from the mixture of NRMI in (5). To this end, one can adapt Theorem 2 in 
[18] to obtain the disintegration 

(PJY) I f (PlX).^(dXlY), (49) 

where ^(X]Y) stands for the marginal distribution of the latent variables X, given 
the observables Y. From (49) combined with Theorem 3.1, a first description of the 
distribution of {Q{g)\Y) follows. Indeed, the posterior density can be represented as 

(/)^(cr;.g)= J p^{a;h)^{dX\Y), (50) 

where p^{(7;h) coincides with the density function given in Theorem 3.1 with h{x) ~ 
Jy g{y)k{y,x)X{dy). Moreover, the posterior distribution function is of the form 

G^(a;5)= J F^(ct; /i).^(dX| Y), (51) 

where ¥-^{a;h) is given in Theorem 3.1, with h defined as above. Note that the pre- 
vious expressions could also have been obtained by combining the discretization of the 
observation space employed in [33] with the limiting arguments used in the proof of 
Theorem 3.1. 

In order to get explicit descriptions of the posterior distributions in (50) and (51), 
we need to find expressions for the marginal distribution of X given Y. From Bayes' 
theorem, it follows that 

,^(dxiv). ,nj.,^w^x) 

Jx" Ilt=ik{Yt,Xi)^{dX) 

It is clear that the marginal distribution of X can be described by the joint distribution of 
the distinct variables X* and the induced random partition tt. In Proposition 4 of [23], an 
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expression for the joint distribution of (X*,7r) is provided. By inserting this formula into 

(52) , suitably rearranging the terms and making use of the partition notation introduced 
at the beginning of Section 2, one obtains 

.5(dxiY) - ^(^" .»-ie--^(") ITS ("1-^;) 

^"-ie-^(-) U;LV /x n.ec, (u|x)a(d:r)] d^ ' 

where stands for the sum over partitions and the Tn- 's arc defined in (11). Inserting 

(53) into (50) and carrying out suitable simplifications, one obtains the desired expression 
for the posterior density. As for the posterior distribution function, insert (53) into (51) 
and, after some algebra, the result follows. □ 

Proof of Proposition 4.1. The proofs of both representations arc based on Theo- 
rem 3.1 and on the key relation (39) discussed in Remark 4.1 of Section 4.2. As for 
the posterior density function, note that r„^. (w|X*) = T{nj)[(3{X*) + and that 

Kn^{it[giX*) - z]\X*) =r{njmx*) - itigiX*) - z)]-"^ for j = 1, . . . , n(7r). Inserting 
these expression into the general one of Theorem 3.1 and carrying out the appropri- 
ate simplifications using (39) allows the posterior density function of P{g) to be writ- 
ten as desired. With reference to the posterior cumulative distribution function, one 
has that, given X and C/„, the jth jump jj*^"'-^^ is gamma distributed with parame- 
ters {f3{X*) + Un, Uj), for j = 1, . . . , n{'7v). Hence, we have Cg{cr; tt, t) = exp[— log(l — 
it{g{x) — a)[f3{x) + u]~-^)a* {dx)] for (17). Finally, it is easy to verify that t^^(^g{a;u,t) is 
absolutely integrable in (M, -|-oo) for any M > and u > 0. Thus, appropriate simplifi- 
cations between denominator and numerator lead to the result. □ 

Details for the determination of (24). Given g ~ 1a and a{A) = ctiA"^) = 1, the 
integral in (23) can be written as 



cos(2 arctani(l — z)//3i — arctante//3; 



2) 



[/32+t2(l_^)2][^2+i2^2]l/2 

1 

Wl+t^ii-z^M+t^z^]^/^ 



dt 



{ 2( ^(1-^) 

< COS arctan — 

I \ Pi 



tz\ . t{l-z)\ ( tz 
COS I arctan — I — sm arctan — I cos arctan — 

P2/ V Pi / V P2 



„ . / t(l~z)\ ( t{\~z)\ . / tz^ I , 

2 sm arctan cos arctan sm arctan — \> dt 



\ Pi J \ Pi J \ P 

and it can be easily seen that it reduces to 



/o [l3f + t^{l-z)V2+t^^'V^' 
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t\l-zf h t{l-z)P, tz 

Pl+t^l~zY ^/?2+t2^2+ pl+t^i^l-zf ^0r^Wz^] 

e{l~z){2l3^z-p.2{l-z))+PlP2 

(1 - Z)mz - m - -))(l_l)4,2 4^^/(1 _,)(^^"l_,)+^^/,)2 
, 7T/32/32 



2(1 - Z)4z2 ^ /32(/32/(l _ - /3|/z2)2 

1 1-Z 



2(^2/(1 - Z)2 - /3|/z2)/33/(l _ ^)3 (^2/(1 _ _ /32/^2)2^^ 
7t[2/3iZ + /32(l-z)] , 7t[2/33z3-3/32/?2z2(i_^)^^3(i_^)3] 



4/3i[/?iZ + /32(l-z)]2 4/3i[/32z2_^2(i_^)2 

nz 



[/3iz + /32(l-z)]2' 

from which (24) foUows. □ 

Proof of Proposition 4.2. The representations in (25) and (27) are obtained by a 
direct application of Theorem 3.2: one only needs to compute the quantities already 
exploited in the proof of Proposition 4.1 and insert them into the relevant expressions 
of Theorem 3.2. In order to obtain the posterior representations in (26) and (28) which 
make use of the quasi-conjugacy of the extended gamma process, it is enough to apply 
Fubini's theorem and use (39). □ 

Proof of Corollary 4.1. Recall that the Dirichlet case corresponds to the extended 
gamma case with P[x) = c > 0, which, without loss of generality, we can set equal to 1. 
We first derive the posterior density function. To this end, consider (26) and note that 
in this case, within the denominator, one has 

^n^lg-log(l+.)(a+n)^^^ r(a)r(7l) ^ ^^^^ 

+ T{a + n) 

having set a{%) := a. This allows (26) to be rewritten as 

r(a + n)e- A ^os(f}(s)-it{h{s)~z))afjds) 



7tr(a)r(n) 



(55) 
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which, combined with (20) and an application of Fubini's theorem, leads to a posterior 
density of the form 

(a;g) = , (56) 

where is the posterior density of P{h), given X, as given in [23], Theorem 1. But, 
since the posterior mean of a Dirichlet process is again a Dirichlet process mean with 
updated parameter measure , one can choose the simplest possible expression for such 
a density, which, to date, is (30) obtained in [40]. Hence, can be replaced by (30) in 
(56); and (29) follows. Note that (30) can also be derived from the posterior density in 
Proposition 4.1 by considering the case n = 1, for which (54) yields a~^. To obtain (31), 
one proceeds in a similar fashion, starting from (28), and then simplifies the resulting 
expression using the representation for the cumulative distribution function of Dirichlet 
process mean provided in [39]. □ 



Proof of Proposition 4.3. We start by noting that 

P(P(g)<^T)=P(/i(r[.g-al])<0) (57) 

and then apply Gurland's inversion formula to the right-hand side of (57). Showing that 
F is absolutely continuous (with respect to the Lebesgue measure on R), one has that the 
prior distribution of P{g) is of the form (34), with /3 = ar'^ > 0. Note that, thanks to the 
rcparametcrization induced by (57), the generalized gamma NRMI is completely specified 
by the parameters Pq: 7 a-nd /3. In order to make (34) completely explicit, introduce the 
quantities (36) and (37) and observe that 

exp|-/3^[l - it{g{x) - a)]^Po(da;)| = exp{-[/3A,(i) - i/3B.(t)]}. 

This implies that (34) can be represented, by working out the imaginary part, as (35). □ 



Proof of Proposition 4.4. In order to obtain the expressions for the posterior distribu- 
tion, we resort to Theorem 3.1, but we have to take into account the reparametrization 
in (57): hence, we set, without loss of generality, r = 1 and a{dx) = l3Po{dx) in (32). 
Consequently, we get t„^. (m|X*) = 7[1 + 7/]^""^ (1 - 7)„^._i and Kn. {it{g{X*) - z)\X*) = 
7[1 — it{g{X*) — z)]^~^^ (1 — 7)„^_i, for j = 1, . . . , n(iT), where, as before, (a)f, stands for 
the Pochhammer symbol. The explicit form of the denominator is obtained by a suitable 
change of variable combined with the binomial theorem. Some algebra leads to an ana- 
lytic representation of the posterior density of P{g) as in (14) with the function Xg as in 
(38). This proves that first part of the proposition. In order to derive the posterior cumu- 
lative distribution function given in (16), first note that the jumps j^^'"-^") arc gamma 
distributed with scale parameter [/„ -I- 7 and shape parameter — 7. Then, write the 
explicit form of the other quantities involved and carry out some simplifications. It is 
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then easy to see that ({a; u, t) is absolutely integrable in (M, +c>o) for any M > and 
u > 0, thus, the result follows. □ 

Proof of Proposition 4.5. The result is obtained by exploiting the quantities computed 
for deriving Proposition 4.4 and inserting them into the expression of Theorem 3.2. This, 
combined with some algebra, leads to the formulae in (40) and (41). □ 

Proof of Proposition 4.6. Let ¥^{-]g) denote the cumulative distribution function of 
the mean Pz{g), given Z. By Proposition 4.3, ¥^{-;g) coincides with (34), with z/3 in 
place of /?. Reasoning as in the proof of Proposition 4.3, wc obtain 

^''i<^;9) = 7; / -e-^'^^'(*)sin(z/?i3,(t))d<, 

2 n Jq t 

where A^r and Bo- are defined as in (36) and (37), respectively. We now integrate with 
respect to Z in order to obtain the unconditional cumulative distribution function of the 
two-parameter Poisson-Dirichlet process, that is, 

F(a;,) = --^^y^ dt-j^ //-V^^-^"(*)sin(z/^B.(i))d. 
1 IS''/"' sm{0/jaTctanB^{t)/A„{t)) 



2 7T Jo i[/?2A2(t)+/32B2(i)]e/(27) 

where the last equality follows from 3.944.5 in [15]. Simplifying with respect to /? leads 
to the desired result. □ 

Proof or Proposition 4.7. From Proposition 1 in [29], the density function of the 
random vector {Pz{Ai), . . . , Pz{An-i)) with 7 = 1/2 is given by 

/ (wi,...,W„_i) = 2»/2-l7T"/2^ ' K_ri/2{Z(5V An{wi, . . . ,Wn-l)) 

/ n-1 \ -3/2 

1' 1 - E {{Zf3)'An{wu. ■ . ,z«„-i)}-"/^ 



-3/2 -3/2 
X Wi ■ ■ - W. 



If one integrates the above density with respect to fz in (42) and makes use of 6.561.16 
in [15], the result easily follows. □ 
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